Corpus: lat_wikipedia_2012_10K

Other corpora

2.2.5 Most frequent word beginnings

The most frequent word beginnings as character N-grams for N=1...5 with Zipf's diagram


Zipf's diagram for word beginnings


Gnuplot diagram

Top Characters
word rank frequency n-gram
1 2768 c-
2 2465 p-
3 2251 a-
4 1944 s-
5 1819 C-
Top Character Bigrams
word rank frequency n-gram
1 1432 co-
2 927 in-
3 806 pr-
4 721 re-
5 623 Ca-
Top Character Trigrams
word rank frequency n-gram
1 749 con-
2 417 pro-
3 302 com-
4 302 per-
5 236 pra-
Top Character 4-Grams
word rank frequency n-gram
1 241 cons-
2 224 prae-
3 176 inte-
4 143 comp-
5 117 cont-
Top Character 5-Grams
word rank frequency n-gram
1 119 inter-
2 85 super-
3 77 trans-
4 68 const-
5 59 circu-
588 msec needed at 2017-12-31 10:40